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ABSTRACT 

We explore the distribution of mass about the expected sites of galaxy formation in a high-resolution 
hydrodynamical simulation of a ACDM cosmology which includes cooling, star-formation and feedback. 
We show that the evolution of the galaxy bias is non-trivial and non-monotonic and that the bias 
is stochastic. We discuss the galaxy-mass cross-correlation function and highlight problems with the 
interpretation of galaxy-galaxy lensing as due to extended dark matter halos around "typical" galaxies. 
Finally we show that the "external shear" around strong gravitational lenses is likely to be closely 
aligned with the direction of the nearest massive galaxy and to have a power-law distribution which can 
be predicted from the galaxy auto-correlation function. 

Subject headings: cosmology: theory - large-scale structure of Universe 



1. INTRODUCTION 

The distribution of galaxies with respect to the total 
mass in the Universe remains a central unsolved prob- 
lem in cosmology, though one which is becoming increas- 
ingly amenable to theoretical modeling and ever more con- 
strained observationally. In particular, gravitational lens- 
ing allows us to probe the spatial distribution of the mass 
in the Universe, for example around galaxies and in clus- 
ters of galaxies. In this paper we present predictions for 
the distribution of mass around sites of galaxy formation 
using a hydrodynamic simulation of structure formation 
which includes cooling, star-formation and feedback. We 
believe such simulations can provide ab initio and robust 
predictions for the statistical distribution of the sites of 
galaxy formation, even though the detailed star forma- 
tion rates and galaxy morphologies are sensitive to uncer- 
tain "sub-resolution" physics. As such, these simulations 
can be used to predict, from first principles, the galaxy 
auto-correlation function, the galaxy-mass cross correla- 
tion function (which is the key ingredient in galaxy-galaxy 
lensing studies) and the mass auto-correlation function. 

Originally regarded as an interesting, but not practically 
useful, prediction of Einstein's theory of general relativity, 
gravitational lensing has become a "standard" tool in ob- 
servational cosmology. Strong gravitational lensing has 
a long history, and weak lensing of background galaxies 
by clusters of galaxies is now well established. Just re- 
cently the weak lensing of galaxies by large-scale structure 
ha s been observation ally d emonstrated (van Waerbeke et 
al. |2000|; Bacon et al. |2000|; Kaiser et al. |2000|; W ittman et 



Maoli et al. 



Rhodes et al. 2001), at about 



the level predicted by theories based on gravitational in- 
stability in a cold dark matter (CDM) dominated universe. 
The weak lensing observations probe the (projected) dis- 
tribution of mass in the Universe directly, without refer- 
ence to the distribution of light. This provides us with di- 
rect constraints on the dark matter auto-correlation func- 
tion, for which theoretical predictions are very well de- 
veloped both analytically and numerically on Mpc scales. 
By contrast, galaxy redshift surveys can give us detailed 
three-dimensional information about the distribution of 
luminous matter, providing us with a measure of the 



galaxy auto-correlation function and its evolution. How- 
ever, galaxy formation is still poorly understood, mak- 
ing interpretation of the full information contained in the 
galaxy auto-correlation function difficult. 

Intermediate between these two are dark matter-galaxy 
cross-correlations. These can be measured whenever the 
galaxy distribution is cross-correlated with a tracer of the 
dark matter. Some examples are galaxy-galaxy lensing 
(correlating galaxies and weak lensing shear) , foreground- 
background galaxy correlations and galaxy-QSO correla- 
tions (correlating galaxies with weak lensing magnifica- 
tion). All of these can be interpreted as a projection of the 
galaxy-mass correlation function which we shall discuss. 
To understand the galaxy-mass cross-correlation requires 
us to understand the environments of galaxies, but not 
necessarily the detailed properties of the galaxies them- 
selves. 

Another probe of the mass distribution around galax- 
ies comes from strong gravitational lensing. Although 
the central regions of the primary lens galaxy, whose spa- 
tial structure is not well modeled by current calculations, 
dominates the monopole gravity in a strong lens, the 
quadrupole (and to a lesser extent the higher poles) has 
a significant contribution from the external shear or tidal 
gravity near the lens or along the line-of-sight. All lens- 



ing models require this degr ee of f reedom (Barkana 1996 
Keeton, Kochanek & Seljak 1997 ) and in some cases the 
dominant source can be clearly identified with a nearby 
galaxy. 

The outline of this paper is as follows: we review the 
simulation we shall use in §0, giving details of how we 
identify galaxies and their parent halos. The evolution of 
clustering in the simulation is discussed in §g and we in- 
troduce our primary tool, the correlation function in §0. 
The implications of our simulation for the interpretation 
of galaxy-galaxy lensing is dealt with in §|[ Models of ex- 
ternal shear for strong lensing are dealt with in §@. Finally 
in W\ we summarize our findings. 

2. THE SIMULATION 

Throughout, we s hall u se a new simulation of the Os- 
triker & Steinhardt (1995) concordance model, which has 
O m = 0.3, Q A = 0.7, H Q = lOO/ikms^Mpc" 1 with 



h = 0.67, Ob = 0.04, n — 1 and a 8 = 0.9 (correspond- 
ing to 5h — 5.02 x 10~ 5 ). This model yields a reasonable 
fit to the current suite of cosmological constraints and as 
such provides a good framework for making realistic pre- 
dictions. 

We have used th e Tr ee/SPH code Gadget (Springel, 
Yoshida & White ^00l|) to run a 2 x 300 3 = 54 million 
particle simulation of this model in a periodic box of size 
33.5 /i Mpc. Equal numbers of gas and dark matter par- 
ticles were employed, so Wdark = 1.0 x 10 8 ft. _1 M Q and 
Tigas = 1-5 x 10 /i —1 Mq. The gravitational interaction 
between particles is softened on smal l scales using a cu- 
bic spline (e.g. Hernquist & Katz 198£) and the 'Plummer 
equivalent' gravitational softening in our simulation was 
6/i~ 1 kpc, fixed in comoving coordinates. The simulation 
was started at redshift z = 99 and evolved to z = 0. Unfor- 
tunately the simulation box is too small to reliably predict 
clustering properties at very low redshifts, as the funda- 
mental mode becomes increasingly non-linear for z < 1. 
Hence, in this paper we restrict our analysis to redshifts 
z > 0.5. 

In addition to the gravitational interactions and adia- 
batic hydrodynamics, the code follows radiative cooling 
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Fig. 1. — The projected mass density in the vicinity of the largest 
group in the simulation at z = 0.7. The grey scale indicates the 
projected gas density (in h~ 1 Mq/(/i — 1 Mpc) 2 ) within a cube of side 
3h~ Mpc centered on the most bound particle in the group. Small 
knots of dense gas, which we identify as sites of galaxy formation, 
can be seen clearly. Like many large groups, this one is part of a 
large filamentary structure, which explains the spatial orientation 
of the distribution of sub-halos. 



and heating processes in the presence of a UV radiation 
field in essentially the same way as described in Katz, 
Weinberg & Hernquist ( 1996] ). We model the UV radi- 



ation field using a modified Haardt & Madau ( 1996 ) spec- 
trum with reionization occuring at z rs 6 (see e.g. Dave 
et al. 1999), and with an amplitude chosen to reproduce 
the mean opacity of the Lyman-alpha forest at z — 3 (e.g. 
Rauch et al. 1997). Star formation (and feedback) is han- 
dled using a mod ification of the "multi-pha se" m odel of 
Yepes et al. ( |1997D and Hultman & Pharasyn ( |1999| ). Each 
SPH particle is assumed to describe a co-spatial fluid of 
ambient hot gas, condensed cold clouds, and stars. Hydro- 
dynamics is followed for only the hot gas phase, but the 
cold gas and stars are subject to gravity, add inertia, and 
participate in mass and energy exchange processes with 
the ambient gas phase. The algorithm will be described in 
more detail in a forthcoming paper. 

From the simulation outputs we have constructed cat- 
alogues of halos and their sub-halos using t he alg orithm 
Subfind described in detail in Springel et al. ( |2000 ). First, 
the Friends-of- Friends algorithm (with a linking length of 
0.15-h^ 1 ^ 3 ) is used to define a parent halo catalogue, and 
then bound sub-halos within each parent are identified. 
Sub-halos are defined as locally overdense, gravitation- 
ally bound structures. These sub-halos typically consist 
of cold, dense gas at their center, surrounded by a halo 
of dark matter and tenuous hot gas. The halo may be 
severely truncated if the sub-halo is not the central galaxy 
in the parent halo, but the dense gas that has been able 
to efficiently cool and form stars should allow a reliable 
identification with galaxies in the real Universe, at least 
statistically (see Fig. [l]) . 

We have used a linking length of 0.15 rather than the 
more canonical 0.2 in defining the parent halos, because 
we found for the larger linking length two neighboring but 
distinct halos were frequently linked into one parent halo, 
with one of them then identified as a sub-halo of the other. 
While this problem is not eliminated entirely using a link- 
ing length of 0.15, it is significantly reduced. Only very 
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Table 1 

The number of sub-halos found in the simulation as a 

function of redshift. the first column is the total 

number of sub-halos with more than 32 particles. 

The SECOND COLUMN is the number of sub-halos with 

M500 > 10 10 h~ 1 M <3 . Third, the number of sub-halos 

WHICH LIVE IN A PARENT "FOF" HALO WITH 

M500 > 10 10 h~ 1 M e . The last column lists the 

fraction of sub-halos above 10 10 h~ 1 m q which are 

"isolated", i.e. are the only members of their parent 

HALO. 



close halo pairs or triplets, possibly in the process of merg- 
ing, are joined with this linking length. In some instances 
the FOF algorithm finds halos which are not bound; these 
halos have in general very small particle number and are 
not included in the analysis. Results from the halo finding 
are shown in Table [l]. 

For each halo or sub-halo we define the "center" as the 
position of the particle with the minimum potential en- 
ergy, with the potential calculated using only the group 
particles. This definition usually corresponds very closely 
to the most bound and densest particles, and is more ro- 
bust than the center of mass. Many definitions of mass are 
possible an d can differ from each other quite significantly 
(e.g. White 2001). Although counting the particles within 
the FOF group is the simplest, we have chosen to follow a 
more commonly used approach where the mass is defined 
as that enclosed within a radius inside of which the mean 
density is 500 times the critical density. An alternative 
definition could use the background density rather than 
the critical density. The two scale differently with redshift 
except for cosmologies with critical matter density. At the 
redshifts at which we are working the density contrast with 
respect to background scales as [fi m + £1a(1 + .z)~ 3 ] . 

In Fig. g we show a scatter plot of the masses of the 
sub-halos and their parent halos at z = 1. Most of the ha- 
los are isolated and have parent halo masses fractionally 
larger than the sub-halo masses. As the parent halos be- 
come more massive there is less chance that it hosts only 
one galaxy. In our first pass through we found that there 
are a very small number of systems where the sub-halo 
mass exceeds the parent halo mass. This can arise in situ- 
ations where several halos are merging, with the bridging 
material being at low density. If the most bound particle 
in the group lies at the center of a less massive sub-halo our 
definition of mass, M500, returns the mass of this sub-halo 
as the "parent" mass. In such cases we replace the "par- 
ent" mass with the sum of the sub-halo masses to more 
accurately reflect the total mass of the system. 

Ideally, we would identify the sub-halos as galaxies by 
some observational property, such as luminosity or color. 
Unfortunately, it is difficult to reliably compute such prop- 
erties from this simulation, because the outputs are too in- 
frequent to graft on population synthesis codes. However 
we do give, in addition to the total mass of a sub-halo, 

log lo (M 5OO /M ) 
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Table 2 

Fraction of sub-halos which are isolated as a 
function of mass and redshift. there are no halos in 

which are 
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Table 3 

The (mass weighted) mean redshift of star 
formation as a function of redshift. 
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Fig. 2. — A scatter plot of M500 for the sub-halos and their parent 
FOF halos from the simulation at z = 1. 



THE SIMULATION WITH M500 > 10 12 h 1 Mq 
ISOLATED FOR Z > 0.5! 



results as a function of the stellar mass within rsoo of the 
sub-halo center. We do this under the assumption that 
the near infrared magnitude of a gal axy sh ould roughly 
track its stellar mass (e.g. Cole et al. 2001 ). The (mass 
weighted) mean age of the stars at each output is given in 
Table [| to help with this conversion. Finally, to help in 
matching our halos to observed galaxies we give the num- 
ber density of our halos, as a function of z, in Fig. g. As 
a first approximation, and in the absence of further infor- 
mation, one can match halos of a given mass to objects of 
an equivalent space density. 

3. THE CLUSTERING OF SUB-HALOS 

We are interested in the clustering of galaxies and dark 
matter, and its evolution with time, within this simulation. 
We show in Fig. the dark mattern power spectra at z = 4, 
3, 2 and 1. To compute these power spectra we estimated 
the dark matter density field from the pa rticles using NGP 
assignment (Hockney & Eastwood 1988) onto a 256 3 grid. 
The density field was then Fourier Transformed to obtain 
5k- The power spectrum was obtained by binning \5k\ in 
shells in fc-space, corrected for the binning, the mass as- 
signme nt on to the grid and shot-noise (see e.g. Peacock & 
Dodds 1996 for a discussion of some of these issues). To 



extend the calculation to higher k we repeated the process 
several times, rescaling the particle separations by increas- 
ing factors and remapping the distribution into the peri- 
odic volume e ach t ime (Peacock, private communication; 



Jenkins et al. 1998 ). This provides us with an estimate of 



P{k) limited only by the resolution of the simulation, and 
not by the size of the Fourier transform grid. 

Compared to semi-an alytic estimates of the expected 
power (Peacock & Dodds 1996 ) we find that this box has a 
slight shortfall. To check whether the semi-analytic model 
correctly estimates the dark matter power spectrum for 
this model we have run two additional dark matter only 
simulations, each using 256 3 particles. The first simulation 
is in a 200/i _1 Mpc box and the second in a 100ft. _1 Mpc 
box. The power spectra computed from the 2 = 1 outputs 
of these runs in the same manner as above are also shown 
in Fig. [|. We see that the fitting function has slightly less 
power than the simulation at small sca les, a s seen also in 
the simulations of Jain, Mo & White ( 1995| ). The power 
spectrum in our 33.5ft. _1 Mpc box is low on all scales at the 
redshifts of interest, although the shape is approximately 
correct. Fluctuations in the amplitude, but not the shape, 
of the power spectrum due to finite volume effects are well 
known (e.g. Meiksin & White 1999 ). Part of the shortfall 
in power is due to the particular random phases chosen in 
the initial conditions, the remainder is due to the fact that 
the box is becoming less and less of a fair sample of the 
Universe as time evolves and the non- linear scale increases. 
We show the latter effect by plotting, in Fig. || the growth 
of the fundamental mode, A 2 (fc/), compared with linear 
theory. As long as the box remains a fair sample of the 
Universe the ratio A 2 (kf)/D 2 (z), where D(z) is the linear 
growth factor, should remain c onstan t. 

In earlier work (White et al. [2001 ) we showed that the 
number of "galaxies" per dark matter halo scaled approx- 

x The power spectrum of the gas mass follows the dark matter 
power spectrum to k > lOftMpc" 1 . The stellar particles are more 
strongly clustered as expected. 
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Fig. 3. — The mass function for the simulation at z = 2 (top), 1 
(middle) and 0.5 (bottom). Solid symbols are for the "parent" halos 
found with FOF with a linking length of 0.15. Open symbols count 
halos identified by the sub-halo finder, removing unbound particles. 
We show the mass function as a function of total mass (squares), 
gas mass (circles) and stellar mass (triangles). In all cases the mass 
quoted is M500 in h _1 Mq (see text). 



imately as a power-law in the parent halo mass with an 
index less than 1 (typically 0.7-0.8). This behavior is pre- 
cisely what is needed to explain the ob served clustering of 
galaxies (e.g. Scoccimarro et al. 2000 ) and is also seen in 
semi-analytic models (e.g. Seljak (2000 ). The spatial dis- 
tribution of sub-halos in the simulation is consistent with 
the assumption that every halo hosts a galaxy which re- 
sides at its center and any extra satellite galaxies trace the 
dark matter distribution. It remains an open question ob- 
servationally whether the galaxy densit y traces the mass 
density in halos (e.g. Adami et al. 2001). 

We performed a counts-in-cells analysis of both the mass 
and galaxy number density fields as a function of redshift. 
First, we divided the cubical box up into a grid of cells 
with N c = 32, 64, 128 or 256 cells on a side. We as- 
signed each particle or galaxy to the appropriate cubical 
cell weighting the particles by their mass and the galax- 
ies equally. For the galaxies, we treated them as points 
at the position of their potential minima. This was done 
8 times with the grid shifted to a random position each 
time. Then, we calculated the variance of mass and galaxy 
fluctuations, from which we can define b = cr gg /<J mm , the 
covariance b' = cr^/a^m and the cross-correlation coeffi- 
cient r = <Jg m /u gg a mm . Our results are shown in Table |[ 
We see a clear tendency for the bias to be "stochastic", 
with decreasing correlation between the galaxies and the 
total mass as we probe smaller length scales. 

4. CORRELATION FUNCTION 

We identify the center of a "galaxy" as the position of 
the minimum of the potential of a bound sub-halo found 
with Subfind. Around each such center we calculate the 
probability, in excess of random, of having mass M within 



0.5 




Fig. 5. — The power in the fundamental mode vs linear theory. 
We plot the ratio of A 2 (kf) to the linear growth factor, D(z). This 
should be constant as long as the box is a fair sample of the Universe. 
The power begins to fall below the linear theory prediction at late 
times because our periodic box is missing long-wavelength power on 
scales larger than the fundamental mode, which are not available to 
be coupled into the evolution as kf goes non-linear. 
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Fig. 4. — The dimensionless dark matter power spectrum as a 
function of redshift. The solid lines indicate the expected non-linear 
power spectrum fo rJibia, model, as estimated using the formalism of 
Peacock & Dodds (1996) at z = 1, 2, 3 and 4 (top to bottom). The 
solid symbols are the power spectra at z = 1 from two 256 3 DM only 
simulations in boxes of size 200h~ x Mpc (squares) and 100h~ iJVTpc 
(triangles) which provide and estimates of the uncertainty in the 
Peacock & Dodds formalism. The open circles indicate the power 
spectra from the simulation in this work, the 33.5/i — 1 Mpc hydro 
simulation, at z = 1, 2, 3 and 4 (top to bottom). 
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Table 4 

counts-in-cells of the mass and galaxy number 

density fields at z — 1. the first column is the 

number of cells in each dimension of the box, the 

second column gives the rms fluctuation in the mass 

(in units of the mean), the third the rms fluctuation 

in the galaxy number density (using only galaxies 

with m500 > 10 10 h~ 1 m q ) and the final column gives 

THE CROSS CORRELATION COEFFICIENT. We HAVE STOPPED 

AT N c = 256 SINCE TO IMPLEMENT FINER GRIDS WE WOULD 

NEED TO USE THE EXTENT OF THE SUB-HALOS RATHER THAN 

JUST THEIR POSITIONS. 



a spherical shell of radius r and width dr. Stellar mass, 
cold and hot gas mass and dark matter are all included 
in this accounting. We present our results in terms of the 
correlation function which we define as 



4- 



M(r; dr) 
(M(r; dr)) 



f, 



(1) 



where M(r; dr) is the mass contained within the shell be- 
tween radius r and r + dr and (■ • ■) indicates an average 
quantity. To safely avoid numerical resolution effects we 
shall consider galaxies only above 10 10 h~ 1 M & . 

We show in Fig. |] the correlation function of the mass, 
£mmM> the galaxies above 10 10 h~ 1 M ( 7 ) , £, gg (r), and the 
galaxy-mass cross correlation, £ gm (r), at z = 1, 2 and 
3. The correlation functions are well described by power 
laws (with a slowly changing slope) and on scales above a 
fewx 100/i _1 kpc the galaxy-mass cross correlation function 
is just the geometric mean of the mass-mass and galaxy- 
galaxy autocorrelation functions (see Fig. 0). While the 
mass correlation function grows steadily with time the 
evolution of the galaxy correlation function is more com- 
plicated. We give the correlation length, ro, defined as 
£( r o) = 1, vs. redshift in Table ||. Note that on length 
scales approaching l/i _1 Mpc our box is missing power (as 
described above) and so these correlation lengths are bi- 
ased low. These results are broadly consistent with the 
clustering prope rties o f "ga laxies" in simulations reported 
by Katz et al. (1992, 1999) once differences in mass res- 
olution, box size and galaxy identification are taken into 
account. 

Finally, we make a distinction between two types of 
galaxies, those that are the sole resident of a dark matter 
halo ( "isolated galaxies" ) and those which are members of 
a larger halo containing several sub-halos (see Tables |l|, ||) . 
As expected the more massive galaxies reside in the more 
massive parent halos which host more than one sub-halo 
and so the isolated fraction decreases with mass. Most 
of the lower mass sub-halos are isolated. As we shall see 
below, we expect the traditional interpretation of galaxy- 
galaxy lensing to be more correct for the isolated galaxies, 
which in our case means those of low mass. 

5. GALAXY-GALAXY LENSING 
The observational study of gal axy-galaxy lensing has a 



long history (Tyson et al. 1984) although only recently 



have definitive detect ions been made by sev eral g roups 
(Brainerd et al |1996| ; Dell Antonio & Tyson |1996) ; Grif- 
fiths et al. |1996l; Hudson et al. |1998|; Natarajan etal. 11998 



1l 



Wilson et al. \ZQQOf , Fischer et al. pOOOfl . Such studies ha- 



lve 
traditionally been interpreted as constraints on extended 
dark matter halos around "typical" galaxies (see Fig. ||). 
A more modern interpretation, within the context of large- 
scale structure, is as a proj ection of the 3D galaxy-mass 
correlation function (Kaiser 1992). As with any such pro- 
jection, the effect of material along the line-of-sight but 
not physically associated with the object in questi on ca n 
be a serious one (see e.g. Metzler, White & Loken 2001). 
We shall not address this issue here, focusing instead on 
the underlying 3D correlation function itself and its inter- 
pretation. 

Similar work on galaxy-galax y len sing has already been 
presented by Guzik & Seljak (200C) using semi-analytic 
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Fig. 6. — The 2-point correlation functions of the mass (solid 
lines), galaxies with M500 > lO lo /i -1 M0 (dashed) and the galaxy- 
mass cross correlation function (dotted) at z = 1, 2 and 3. 
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Fig. 7. — The cross correlation coefficient for the parameters in 
Fig. pi The signal is noisy for high redshift and small distance, due 
to the finite number of "galaxies" and mass resolution limitations 
in the simulation. 
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Table 5 
The correlation length ro defined by, £(ro) = 1, in 

/t _1 KPC, AS A FUNCTION OF REDSHIFT. THE FIRST COLUMN 

(FIT) GIVES THE DARK MATTER AUTO-CORRELATION LENGTH 

COMPUTE D FRO M THE FITTING FUNCTION OF PEACOCK & 

DODDS (1996), THE OTHER THREE COLUMNS GIVE THE 

LENGTHS COMPUTED FROM THE SIMULATION. NOTE THE 

SHORTFALL IN POWER, DISCUSSED IN THE TEXT. THE MASS 

CORRELATION FUNCTION £ mm INCREASES MONOTONICALLY 

WITH TIME WHILE THE GALAXY AUTO-CORRELATION 
FUNCTION (FOR GALAXIES WITH M500 > 10 1 
AND GALAXY-MASS CROSS-CORRELATION, £ gm , 
COMPLICATED EVOLUTION. 
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Fig. 8. — The spherically averaged mass profile, scaled to the 
background density p = n m p cr it, for the 4 "galaxies" with M500 
within 5% of 10 12 /i _1 AfQ, extracted from the 2 = 1 output. The 
solid lines include all of the mass, the dotted lines only the baryonic 
(gas+stellar) mass. The dashed lines show NFW profiles with c = 5 
and 10 (see Eq. Q). The secondary "peaks" in the profiles are due to 
neighboring halos — as discussed in the text none of these "galaxies" 
are isolated though all are centrally located in the parent halo. 



models of galaxy formation. By comparison with the semi- 
analytic models, our simulations have better spatial and 
mass resolution and include far more physics. Unfortu- 
nately limitations on computer resources have forced us to 
simulate a relatively small volume of space. This both re- 
duces the size of our samples for statistical purposes and 
limits the minimum redshift to which we can accurately 
follow the development of large-scale structure. It is thus 
encouraging that our results are in good agreement with 
theirs in many respects. 

As a first step we therefore ask how £, gm (T) relates to 
the profile of an "average" galaxy in our simulation? As 
shown in Fig. pL the spherically averaged profiles of our 
halos are r easonably well fit by the NFW profile (Navarro 



et al. |1996| ) 



p(r) 



Pa 



x(l + x) 2 
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Fig. 9. — (top) The profile obtained from the simulation at z = 
0.5 if we interpret ^ S m(r) as a density profile through p e ff( r ) = 
p (1 + fg m (r)). The bold curve indicates the region where we have 
explcitly computed § gm from the simulation, the rest is power-law 
extrapolation. The dashed lines are NFW profiles with M VIr = 
2 X 10 11 /i _1 Mq and c = 5 and 10 (see text), (bottom) The rotation 

curve from this profile, assuming v c = WGM/r, and for the two 
NFW profiles. 



where x = r/r s is the radius scaled in units of a charac- 
teristic radius r s . The central density is fixed by specify- 
ing the halo mass. While we have worked throughout in 
terms of Af 50 o , the mass of the halo is typically defined to 
be the virial mass M = (47r/3)(5 v irA;rit?"vi r where we take 
the virial radius, r v irj as the radius within which the mean 
density enclosed is £ v ir times the critical density. Since our 
cosmology has J7 m ^ 1, the top-hat model prediction for 
5 v h is redshift dependent, taking the value i5 V i r ~ 100 at 



0, S v 



140 at 



0.5 and <L 



18n 2 



The parameter c = r v - lx /r s measures the degree of central 
concentration of this mass. 

To estimate a profile from the cross correlation we define 
p e ff(r) = /5(1 + £ gm (r)), using a power-law extrapolation 
to extend the range for both large and small radius be- 
yond what we have computed explicitly from the simula- 
tion. The results are shown in Fig. [| along with two NFW 
profiles of virial mass M140 — 1.8 x 10 11 /i _1 M0, which is 
the "average" mass of halos in the simulation at z = 0.5. 

The concept of an "average" mass is somewhat nebu- 
lous. Simply averaging M500 for the halos identified at 
z = 0.5 gives (M 500 ) = 9 x lO lo /i -1 M . Relating this to a 
virial mass is complicated by the fact that the concentra- 
tion, c, and thus the ration M v ; r /M5oo, varies wi th m ass. 
An alternative route is to use the Jenkins et al. ( |2000 ) fit 
to the mass function to compute 



(M) 



KL Mdn 
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M,, 



(111 
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above A/ cut = 1.4 x 10 10 Ii^Mq. The limiting mass is ob- 
tained by converting M500 = 10 10 h~ 1 M & to M140 for an 
NFW profile with c = 10. [We find that for z > 0.5 the 
virial mass and M200 differ by less than 10%, so one could 
alternatively use Af 2 oo throughout.] We employ this con- 
version because the virial mas s is clo ser to the "mass" def- 
inition used by Jenkins et al. ( [2000 ) than the M500 values 
preferred in this s tudy. While several steps are involved 
here, White fl200l| ) has shown that using an NFW profile 
to convert between such mass definitions works very well. 
Varying the value of the concentration parameter in our 
conversion from A/500 to A/140 makes little difference to 
our average mass. For a concentration of 5 the mass ratio 
is M140 = I.6M500 and M cut = 2x 10 11 /i _1 M . 

As we can see the NFW profiles, while providing a 
good fit to individual "galaxies" within the simulation 
(e.g. Fig. ||) do not provide a very good fit to the pro- 
file p c s(r). Also, using p@s(r) we would estimate that 
the virial mass of an "average" galaxy is A/102 = 1.0 X 
\0 ll h~ l M Q (M 500 = 5.2 x 10™ h^M®, c.f. 9 x 10 10 
above), a factor of two lower than the average computed 
above. Finally we note that the associated rotation curve 
for our "average" galaxy is much flatter than the rotation 
curves for the individual galaxies making up the average. 
This is a consequence of the varying virial radii and the 
weighting by the mass function and is not reflecting the 
distribution of matter in the individual halos. 

These results suggest that galaxy-galaxy lensing, wherein 
one increases the signal-to-noise by stacking many galax- 
ies, is not in fact measuring the profile of an "average" 
galaxy in the usual sense of these words. The main prob- 
lem is that galaxies come in a wide range of masses and 
sizes, and live in a range of environments so the naive av- 



eraging loses its significance. If it were possible to restrict 
the galaxies going into the average then a more faithful 
representation could be obtained (although projection ef- 
fects could still be a significant source of error), but we 
have no a priori way of knowing the galaxy mass. These 
issues should be borne in mind when attempting to use a 
weak lensing analysis to estimate halo profiles or mass-to- 
light ratios from galaxy-galaxy lensing. 

There are two effects which muddy the waters. First, 
not all galaxies are the sole members of their dark matter 
halos. Even for those galaxies which are "isolated" what 
one measures is the integral over the mass function of halo 
profiles. Since more massive halos are in general larger, 
there is no sense in which one m easures an "average" pro- 
file. For example, Seljak (200C) has suggested that much 



of the large-r signal in this case comes from the larger 
halos rather than the asymptotic behavior of the smaller 
halos. 
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Fig. 10. — The galaxy-mass cross correlation, §gm('") at z = 1. In 
each panel the solid line is the cross correlation including all massive 
galaxies. In the top panel, we show separately the contributions 
from "isolated" galaxies (dotted lines) and from galaxies sharing 
halos (dashed lines). In the middle panel we show the cross cor- 
relation around galaxies with total masses 10 10 /i — 1 Mq < A/500 < 
10 11 h- 1 M e (dotted), 1O 11 ^ 1 M0 < Af 5 00 < lO^r^MQ (short 
dashed), lO 12 /! -1 M < M 500 < 10 13 h~ 1 MQ (long dashed). 
The more massive sub-halos are rare, leading to a noiser sig- 
nal. In the lower panel we show the cross correlation around 
galaxies with stellar masses 10 9 h~ 1 Mq < M500 < 10 10 /i _1 Mq 
(dotted), lO lo h- 1 Af < Af 50 o < lO 11 /!" 1 M Q (short dashed), 
lO 11 ^- 1 *^© < M500 < 1O 12 /i _1 M0 (long dashed). The situation 
at higher and lower redshifts is very similar to that shown here. 



In Fig. [l(] we show that, although the "isolated" galax- 
ies are the major ity of the number counts (Table |], see 



also White et al. |2001| ), the signal for £ gm (?") is interme- 
diate between that for isolated and non-isolated groups. 
The two curves begin to merge at very large radius where 
the density is becoming close to the cosmic mean. This 
suggests that at larger radius (r ~ l/i -1 Mpc) we are prob- 
ing not the galactic mass profile but the general distribu- 
tion of matter and the isolated and non-isolated galaxies 
have similar large-r profiles. Further, the fact that £ gm dif- 
fers from the isolated halo case at smaller r indicates that 
even there one is not measuring the profile of the galactic 
halo. Galaxies which are part embedded in larger halos 
contribute disproportionately to the signal from 10/i _1 kpc 
out to several 100ft- _1 kpc. It is this "intermediate" range 
of radii wher e the signal is dominated by larger halos, as 
Seljak ( [2000 ) suggests. In either case, when one measures 
£gm (r) one is not simply measuring the mass profile around 
a galaxy located at t he ce nter of its dark matter halo. 

Guzik & Seljak ( 2000 ) have suggested that isolated 
galaxies should show stronger cross-correlation at small-r 
and fixed mass, since they are all centrally located within 
their halos. At fixed galaxy (i.e. sub-halo) mass the oppo- 
site effect is true, since the non-isolated galaxies tend to 
live in more massive halos. However, if we hold the mass of 
the parent halo rather than that of the galaxy fixed we can 
correct for this effect. We find that if we do this then the 
central galaxies are often in denser environments, in the 
sense that £ gm is larger at very small r ~ 10 /i -1 kpc. How- 
ever by r ~ 100 /i -1 kpc the opposite effect is true and the 
non-isolated galaxies have the stronger cross correlation. 
This persists until r ~ 1 /i _1 Mpc where the cross correla- 
tions for isolated and non-isolated galaxies are equal. We 
also note that the density around the peak is determined 
primarily by the mass and formation time of the sub-halo, 
not on whether it is part of a larger structure. Second, 
many of our systems contain distinct sub-halos, in which 
however the "bridging" material is at quite high density 
(indicating that it really is part of a larger structure and 
not an artifact of our group finding technique). In some 
sense all of these sub-halos are "central" and have a simi- 
lar mass profile even though they belong to a larger halo. 
This shows that the idealization of spherical parent ha- 
los is a poor approximation to the groups we see in the 
simulation. 

To understand the effect of the integration over mass we 
show the signal broken up in different ways in Fig. 1JX In 
the middle panel of Fig. JIG we break things down by the 
mass of the galaxy itself. Since there is a trend for more 
massive galaxies to be larger and to live in more massive 
parent halos, their cross-correlation is larger. Again, even 
though the vast majority of galaxies (by number) have low 
mass, the signal comes from a range of masses. We also 
break the signal down by the galaxy stellar mass, under 
the assumption that the stellar mass is roughly tracing 
the near infrared luminosity of a galaxy and the trends 
are similar to the total mass case. 

Thus it appears that the oft-stated claim that galaxy- 
galaxy lensing probes the profile of dark matter halos 
around galaxies is not true in detail. 

6. STRONG LENSING AND EXTERNAL SHEAR 



The angular structure in the gravitational field near a 
multiple-image gravitational lens arises from three compo- 
nents: the intrinsic structure of the primary lens galaxy, 
local tidal shears generated by other halos and structures 
correlated with the primary lens, and the accumulated 
weak or large-scale structure (LSS) shear along the ray be- 
tween the observer and the source. In most circumstances, 
the LSS contribution can be modeled as an addition al tida l 
shear near the primary lens (Kovner 1987] , Barkana 1996). 
Simple models for the tw o sou rces of tidal perturbations 
(Kochanek & Apostolakis 1988] ; Keeton, Kochanek & Sel- 
jak 1997] ) suggest that the shear from correlated structures 
is more important than the shear from LSS. Keeton et 
al. (1997) also found that models for all four-image lenses, 
whose geometry makes the models very sensitive to the 
angular structure of the gravitational field, show dramatic 
improvements when the model has two axes for the angu- 
lar structure of the gravitational field. These two axes pre- 
sumably arise from the major axis of the primary lens and 
the major axis o f the combined tidal shear contributions. 
Kochanek ( |200l| ) has shown that one angular component 
is clearly aligned with the primary lens galaxy, while the 
other has the amplitude expected from tidal perturbations. 

While standard weak lensing methods can be used to 
calculate the statistical properties of the LSS shear con- 
tribution, they are not well suited to estimating the con- 
tribution from structure correlated with the primary lens. 
If our simulation is correctly modeling the sites of galaxy 
formation, however, it is ideal for exploring the correla- 
tions between the shear generated by the density distri- 
bution and estimates of the shear based on the virialized 
halos which we can observe as galaxies. Here we begin 
this exploration by posing several questions, focusing on 
the properties of the correlated shear. Answers to these 
questions can be of practical use in understanding gravita- 
tional lenses from observations and models. First, what is 
the amplitude and distribution of the shear perturbations 
generated by structure correlated with the primary lens? 
Second, what is the physical scale on which the shear is 
typically generated? Third, how does the total shear cor- 
relate in direction and amplitude with the distribution of 
galaxies? 

We begin by expanding the potential, projected along 
a randomly chosen line-of-sight, in a Fourier series (the 
projection and Fourier expansion don't "commute"). Of 
particular interest is the "shear" or quadrupole moment of 
the projected potential, which we define as the coefficient 
of the R 2 term near the origin, or 
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where R is the 2D projected distance. We obtain the shear 
by rewriting the potential in terms of the lensing potential 
ip, which satisfies V 2 ip = 2S/S cr it, and then taking the 
absolute value 
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For a source at z = 1 and a lens at z = 0.5 the comoving 
critical density is 2 x 10 15 ft _1 M /(/i- 1 Mpc) 2 . Through- 
out we shall quote S C ritV'2 which can be scaled to any given 
lens redshift using Eq. (Q). 

In Eq. (Q) we need to define the region of integration. 
We are not interested in the contribution to this shear 
from the galaxy itself, which is in any case difficult for us 
to resolve. Nor are we interested in the contribution from 
uncorrelated large-scale structure along the line-of-sight. 
Thus we shall compute the shear within a shell extending 



from r min to r„ 



where r denotes a 3D distance. For 



r max we choose roughly twice the correlation length of the 
galaxy-mass cross correlation function or 2h~ 1 Mpc. In 
fact the shear converges well within this radius and the re- 
sults for r max = l/i _1 Mpc are almost identical. Since the 
integral converges rapidly the change of the chord length 
at large-r doesn't affect our results. For r min we choose 
two scales which bracket the reasonable range. At the 
low end we choose r m ; n = 50ft. _1 kpc, just outside the re- 
gion of baryon domination. At the high end we choose 
fmin = 200/i~ 1 kpc, roughly twice the virial radius. The 
amplitude of our results are quite sensitive to this choice, 
as the shear is dominated by nearby structures. Finally we 
also want to exclude matter (and galaxies) which would, 
in projection, lie close to the Einstein radius of the galaxy. 
For this reason we exclude any matter or galaxies with 
R < 10/i _1 kpc. 

Our particle-based estimator for 7 is thus 



£crit7^ 
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where 9j and Rj are the projected coordinates of particle 
j and the sum is over all particles in a shell, centered on 



the galaxy, with 



and Rj > lOh 1 kpc. 



The distribution in amplitude of the shear, £ cr ;t7, is 
shown in Fig. 0. For 7 > 10~ 2 the distribution is quite 
well fit by a power-law, with the amplitude somewhat de- 
pendent on the value of r m ; n we choose. Towards lower 
values of the shear we have a roll-off, as required when the 
Universe becomes optically thick. Comparing the ampli- 
tude of the shear generated with r m { n = 50/i _1 kpc to that 
with r m in = 200/i _1 kpc we see that much of the shear is 
generated close to the lens. The slope of the di stribution is 
close to the prediction of Keeton et al. (1997), who mod- 
elled the "external" shear as due to singular isothermal 
spheres of fixed radius (for which 7 ~ -R -1 ) distributed 
according to a power-law correlation function £(r) ~ r~ x 
with x = 7/4. In this model P(> 7) is just P(< R) 
so dP/d-y oc 7 X ~ 4 = 7~ 9 / 4 (shown as the dashed line in 
Fig.0). 

Another interesting question is how the shear computed 
using all of the mass compares to that obtained by using 
only the galaxies. We recomputed the shear above using 
all galaxies in the annulus with M500 > 10 w h~ 1 M ( ? ) . We 
find that the direction is reasonably well reproduced, with 
the cosine of twice the misalignment angle 
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galaxy. Here we find that while the shear is still strongly 
peaked near cos 2 29 = 1, it is less strongly peaked than if 
we use all of the galaxies (Fig. |l2p . A much better indica- 
tor is using the nearest massive galaxy (i.e. galaxy above 
10 12 /i _1 Af Q ) in which case the alignment is almost as good 
as using all of the galaxies in the shell. In comparing to 
observations of course it is important to include the shear 
coming from uncorrelated large-scale structure along the 
line-of-sight which may cause a swing in the angle away 
from the direction of the nearest galaxy. 

Finally, scaling our results to the same mean mass den- 
sity we find that the amplitude ratio 



11 = 
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sharply peaked near 1 (Fig. |2|). It is also of interest to 
ask how our result is changed if we take only the nearest 



has a large scatter, with values covering several decades. 
This can be understood physically by recalling that the 
galaxies roughly trace the position of mass concentrations, 
but the amount of mass in the galaxies can be only a small 
fraction of the total mass in a given halo. This fraction 
typically decreases as the halo mass increases. 

7. CONCLUSIONS 

We have presented predictions for the distribution of 
mass around sites of galaxy formation using a hydrody- 
namic simulation of structure formation which includes 
cooling, star-formation and feedback. In such simulations 
galaxies can be easily identified as dense knots of gas which 
stand out strongly from the background. Thus these sim- 
ulations can be used to predict, from first principles, the 
galaxy auto-correlation function, the galaxy-mass cross 
correlation function (which is the key ingredient in galaxy- 
galaxy lensing studies) and the mass auto-correlation func- 
tion. 

While the dark matter clustering agrees very well with 
fitting functions we find that the evolution of the galaxy 
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Fig. 11. — The distribution of E cr ; t 7 around galaxies more massive 
than 10 10 h~ 1 Mq. The solid line marks the shear computed within 
50ft _1 kpc < rj < 2/i _1 Mpc and Rj > 10/i _1 kpc and the dotted 
line 200h~ 1 kpc < rj < 2/i -1 Mpc. Varying r max makes almost no 
difference to the distribution. The vertical dashed line marks £ cr it 
for a source at 2 = 1 and the long-dashed line has slope —9/4 (see 
text). 
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'bias' is non-trivial and non-monotonic and that the bias is 
increasingly 'stochastic' to small scales. The galaxy-mass 
cross-correlation function is approximately the geometric 
mean of the galaxy-galaxy and mass-mass auto-correlation 
functions on scales above a few hundred kpc but there is 
more structure below lOOkpc. 

We find that halos in our simulation have approximately 



NFWl forms, with a large scatter about the mean profile. 
Relating the galaxy-mass cross-correlation to the profile 
of an "average" halo is however fraught with difficulties. 
Neither the mean mass, the profile or the rotation curve 
of a typical halo is well reproduced by interpreting £ gm (r) 
as a galaxy profile. This casts some doubt on the ability 
of galaxy-galaxy lensing to determine the halo properties 
or mass-to-light ratios of "typical" galaxies. 

Finally we have looked at the distribution of shear 
around galaxies which could be strong gravitational lenses. 
We found that the cosmologically relevant distribution of 
shears is well approximated by a power-law. The slope of 
this power-law is in good agreement with a model which 
assumes all galaxies are singular isothermal spheres with a 
power-law correlation function and that the shear is domi- 
nated by the nearest neighbour. The distribution rolls over 
at 7 ~ 10~ 2 with a peak roughly an order of magnitude 
below this. The direction of the shear is well reproduced 
by assuming the galaxies trace the mass, or that the shear 
is dominated by the nearest massive galaxy. The distribu- 
tion of the magnitude of the shear is however quite broad. 
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Fig. 12. — The distribution of cos 2 29, the angle between the 
shear determined using all of the mass and the shear determined 
using galaxies (weighted by their mass). The solid line gives the 
misalignment using all of the galaxies in the r m ; n = 50h~ J kpc shell 
to estimate 7 ga j and the dashed line shows the effect of increasing 
f'min to 200/i~ 1 kpc. In each case the misalignment taking just the 
closest massive (M500 > 10 12 h~ 1 Mq) galaxy is almost indistin- 
guishable. The dotted line shows the misalignment using just the 
closest galaxy above 10 10 h~ 1 Mq for the case r m j n = 50h~ x kpc. 



